Universal Access To All Knowledge
Home Wayback Machine | Archive-It | Blog | Heritrix
Search: Advanced Search
Anonymous User (login or join us) Upload

Most Downloaded Items
Last Week more

  1. FortuneCity Screenshots Collection
    378 downloads
  2. The Archive Team Geocities Snapshot (Part 1 of 8)
    301 downloads
  3. Fill item xnxx.com-20120121-070615 for xnxx.com
    71 downloads
  4. Egao YES Nude (Dance Shot)
    55 downloads
  5. Rihanna- Unfaithful
    51 downloads

Most Downloaded Items more

  1. The Archive Team Friendster Snapshot (000000000)
    7,874 downloads
  2. Encyclopedia Dramatica January 2010 Mirror
    1,596 downloads
  3. Archive Team: Audio and Video of Algathafi.org
    1,210 downloads
  4. The Archive Team Geocities Snapshot (Part 1 of 8)
    1,024 downloads
  5. Wikileaks bulk files
    977 downloads

Spotlight Item

The Archive Team Friendster Snapshot (000000000)
There is no description for this item

About the Internet Archive

Background

Frequently Asked Questions

439,845 itemsWelcome to Web Crawls

Web crawl data from various sources.

All items (most recently added first) - RSS

Sub-Collections

Accelovation Crawl
Crawl data from Accelovation. This data is currently not publicly accessible.
Alexa Crawls
Crawl data donated by Alexa Internet. This data is currently not publicly accessible.
34,385 items
Archive Team
Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake...
5,740 items
Archive-It Digital Collection
The Archive-It Digital Collection
2,330 items
archiveteam-mobileme
Description Forthcoming
302 items
Common Crawl
Web crawl data from Common Crawl.
Cuil Crawl Data
Crawl data from cuil.com.
2 items
Custom Crawl Services
National library harvesting.
16 items
Focused Crawls
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
4,957 items
httparchive
Successful societies and institutions recognize the need to record their history - this provides a way to review the past, find explanations for current behavior, and spot emerging trends. In...
16 items
Institut national de l’audiovisuel
Crawl data from Institut national de l’audiovisuel in France. This data is currently not publicly accessible.
Internet Archive Web Crawls
Crawl data collected by the Internet Archive. This data is currently not publicly accessible in this format. To view archived web pages, please visit the Wayback Machine.
144,843 items
Internet Memory Foundation
Crawl data from Internet Memory Foundation. This data is currently not publicly accessible.
6 items
Mercator Crawl
Crawl done with the DEC/HP-labs 'Mercator' crawler and converted to ARC format. This data is currently not publicly accessible.
National Library of Australia Crawl
National Library of Austrailia crawl. This data is currently not publicly accessible.
2,187 items
Thumper Transfer
Web crawl data transferred from thumpers in Santa Clara data center.
urlteam Web Crawls
Crawl data collected by the urlteam.
4 items
web-group-internal
miscellaneous data
1,898 items
Wikileaks.org Archive
A collection of web pages from the wikileaks websites as well as news coverage and commentary surrounding the Wikileaks releases. It includes coverage of the Afghan war diaries, the Iraq war logs,...
3 items
Wikimedia Downloads
All downloads provided by the Wikimedia Foundation are available in this collection. Most of the files here originate from their designated download server. What is available? Wikimedia projects'...
23,089 items
Wikimedia Foundation Media
Media files from the Wikimedia Foundation.
8 items
Wikipedia Dumps
Data dumps of the wikipedia.org web site.
2,441 items
WikiTeam
WikiTeam software is a set of tools for archiving wikis. They work on MediaWiki wikis, but we want to expand to other wiki engines. As of April 2012, WikiTeam has preserved more than 500 wikis. ...
112 items

Recently Reviewed Items (more)

Archiveteam Splinder Save: 00000017
Average rating:

ShoutWiki wikifarm dump
Average rating:5.00 out of 5 stars5.00 out of 5 stars5.00 out of 5 stars5.00 out of 5 stars5.00 out of 5 stars

WikiTeam Mirror
Average rating:

hindi-wikipedia-6/10/10
Average rating:3.00 out of 5 stars3.00 out of 5 stars3.00 out of 5 stars

Usenet Archive of UTZOO Tapes
Average rating:4.00 out of 5 stars4.00 out of 5 stars4.00 out of 5 stars4.00 out of 5 stars

This Just In (more)

YouTube Video Crawldata 2012-06-02T19:43:20PDT to 2012-06-02T13:51:01PDT
9 minutes ago

Wikimedia media incremental dump files for chywiki on 20120530
9 minutes ago

Webwide Crawldata 2012-06-02T18:59:57PDT to 2012-06-02T13:58:20PDT
11 minutes ago

Wikimedia media incremental dump files for chwiki on 20120530
12 minutes ago

Wikimedia media incremental dump files for chrwiktionary on 20120530
14 minutes ago


 

New PostWayback Machine Forum Subscribe to or unsubscribe from this forum RSS feed of most recent posts to this forum

Subject Poster Replies Date
Sprzątanie grobów Krzysztof2012 1 June 02, 2012 03:56:50am
   Re: Sprzątanie grobów Krzysztof2012 0 June 02, 2012 04:00:35am
Podręczniki Krzysztof2012 0 June 02, 2012 03:46:55am
Online clothing store USA lotottes 0 May 31, 2012 09:11:51pm
Online clothing store USA lotottes 0 May 31, 2012 09:11:01pm
útnyilvántartás útnyilvántartó program 0 May 29, 2012 08:15:39am
Today we talk of Cotton sifter pads flourmilling 0 May 29, 2012 01:18:03am
Types of Elevator bucket flourmilling 0 May 29, 2012 01:17:20am
Do you know the Sieve cleaner steps flourmilling 0 May 29, 2012 01:15:15am
Flour Milling History flourmilling 0 May 29, 2012 01:14:26am
Submit WARC Nemo_bis 0 May 27, 2012 03:29:02am
Remove images/flash bibliosoft 0 May 24, 2012 04:38:41am
Entrevista documento educomunicativo Gchacon 0 May 22, 2012 11:02:36am
Virtual Marketing - Vision with the latest in Techniques developmentindia 0 May 21, 2012 09:45:43pm
Importance of Web Design & Development Consultation developmentindia 0 May 21, 2012 09:42:53pm
I view my site's web history gizliilimler 1 May 16, 2012 03:05:31am
   Re: I view my site's web history Jeff Kaplan 0 May 16, 2012 04:14:57am
Teacher forum in french to be re-archived urgently, please ! spinoza1670 0 May 14, 2012 10:58:26pm
Videos of a site does not want to play! Rafael Moro Pereira 0 May 07, 2012 10:36:27am
Page can't be viewed even if in that time there was no robots.txt jack_mustang 0 May 06, 2012 01:24:38pm
Stealing Content from Wayback Archives ChrisJBrady 2 May 04, 2012 03:53:31am
   Re: Stealing Content from Wayback Archives jory2 0 May 03, 2012 01:21:12pm
   Re: Stealing Content from Wayback Archives alexsimon 2 May 04, 2012 12:40:08am
     Re: Stealing Content from Wayback Archives ChrisJBrady 1 May 04, 2012 03:58:16am
       Re: Stealing Content from Wayback Archives alexsimon 0 May 04, 2012 03:10:22am
     Re: Stealing Content from Wayback Archives ChrisJBrady 0 May 04, 2012 04:00:26am
Is Wayback Machine a malicious site crawling with viruses? angeldeb82 0 May 03, 2012 08:05:03pm
Movies work on iPad! Life Learner 0 May 02, 2012 06:20:16pm
Chumby.com is going away - Request for archiving. Steevow 1 May 01, 2012 08:18:29am
   Re: Chumby.com is going away - Request for archiving. Jeff Kaplan 1 May 01, 2012 01:27:19pm
     Re: Chumby.com is going away - Request for archiving. Steevow 1 May 01, 2012 06:35:39pm
       Re: Chumby.com is going away - Request for archiving. Hydriz 0 May 08, 2012 12:55:21am
"Eveready Harton in Buried Treasure" in Public Domain? santosrr 0 April 28, 2012 08:30:08am
Serbian elections 2012 arsa 0 April 26, 2012 03:29:29am
IP address of a webpage bucz 0 April 24, 2012 07:49:08am

View more forum posts
 

Terms of Use (10 Mar 2001)